Learning to Discover Key Moments in Social Media Streams

نویسندگان

  • Cody Buntain
  • Jimmy J. Lin
  • Jennifer Golbeck
چکیده

This paper introduces LABurst, a general technique for identifying key moments, or moments of high impact, in social media streams without the need for domain-specific information or seed keywords. We leverage machine learning to model temporal patterns around bursts in Twitter’s unfiltered public sample stream and build a classifier to identify tokens experiencing these bursts. We show LABurst performs competitively with existing burst detection techniques while simultaneously providing insight into and detection of unanticipated moments. To demonstrate our approach’s potential, we compare two baseline event-detection algorithms with our language-agnostic algorithm to detect key moments across three major sporting competitions: 2013 World Series, 2014 Super Bowl, and 2014 World Cup. Our results show LABurst outperforms a time series analysis baseline and is competitive with a domainspecific baseline even though we operate without any domain knowledge. We then go further by transferring LABurst’s models learned in the sports domain to the task of identifying earthquakes in Japan and show our method detects large spikes in earthquake-related tokens within two minutes of the actual event.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Knowledge Management Approach to Discovering Influential Users in Social Media

A key step for success of marketer is to discover influential users who diffuse information and their followers have interest to this information and increase to diffuse information on social media. They can reduce the cost of advertising, increase sales and maximize diffusion of information.  A key problem is how to precisely identify the most influential users on social networks. In this pape...

متن کامل

Making sense of social media streams through semantics: A survey

Using semantic technologies for mining and intelligent information access to social media is a challenging, emerging research area. Traditional search methods are no longer able to address the more complex information seeking behaviour in media streams, which has evolved towards sense making, learning, investigation, and social search. Unlike carefully authored news text and longer web context,...

متن کامل

What Is New in Our City? A Framework for Event Extraction Using Social Media Posts

Post streams from public social media platforms such as Instagram and Twitter have become precious but noisy data sources to discover what is happening around us. In this paper, we focus on the problem of detecting and presenting local events in real time using social media content. We propose a novel framework for real-time city event detection and extraction. The proposed framework first appl...

متن کامل

Analysis of Media Consumption Behavior of Sports Fans with a Network Approach

Fans like to talk about their favorite team and players with others. Professional team fans use social media to learn more about teams, connect with other fans, follow teams and players, and build a fan community. Social media by creating a network of users has become a platform for researchers to study fan behavior. Given that members of the fan community interact with each other, their opinio...

متن کامل

Special issue on "Software architectures and systems for real time data stream analytics"

Sensors are now being embedded into almost everything around us. Our surroundings are generating massive and potentially infinite sequences of data streams mainly sourced from non-stationary distributions in highly dynamic environments. Such streams of big data are produced by an increasing number of widely adopted systems including social media platforms, surveillance systems, sensor networks,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1508.00488  شماره 

صفحات  -

تاریخ انتشار 2015